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Abstract — In a distributed storage environment, where the 
data is placed in nodes connected through a network, it is 
likely that one of these nodes fails. It is known that the use 
of erasure coding improves the fault tolerance and minimizes 
the redundancy added in distributed storage environments. The 
use of regenerating codes |1| not only make the most of the 
erasure coding improvements, but also minimizes the amount of 
data needed to regenerate a failed node. 

In this paper, a new family of regenerating codes based on 
quasi-cyclic codes is presented. Quasi-cyclic minimum storage 
regenerating (quasi-cyclic MSR) codes are presented and their 
existence with high probability is proved. Regenerating codes 
with optimal bandwidth constructed from a base MSR code by 
using graphs are also provided. These codes not only achieve 
optimal MBR parameters in terms of stored data and bandwidth, 
but also for a specific choice of the parameters involved, they can 
be decreased under the optimal MBR point. 

Quasi-cyclic regenerating codes are also very interesting be- 
cause of their simplicity and low complexity. They allow a repair- 
by-transfer in the optimal bandwidth case (no operations are 
needed) and a pseudo repair-by-transfer in the MSR case, where 
operations are needed only when the new node enters into the 
system replacing a lost one. 

I. Introduction 

The availability problem of stored data is an essential issue, 
which has been studied for a long time [2 |. The increasing use 
of cloud and distributed storage systems to a massive scale has 
changed the paradigm of data storage, where the amount of 
stored data must be minimized and the recovery from failures 
must be solved. 

The motivation for using erasure coding in distributed 
storage systems comes from the need to reduce the storage 
cost. They allow to achieve high reliability without requir- 
ing the increased storage cost that is associated with data 
replication scheme [3|. There are the obvious cost savings 
from purchasing less hardware to store the data, but there are 
significant savings from the fact that this also reduces data 
centers size, the power for running less hardware, among other 
savings J4). 

The use of erasure coding in distributed environments 
minimizes the amount of data stored in the system, but 
introduces what is known as the code repair problem 0: 
how to maintain the encoded representation when failures 
(node erasures) occur, minimizing the amount of stored data 
and the bandwidth used to repair one node. To maintain the 
same redundancy when a storage node leaves the system, a 
newcomer node has to join the system, access some existing 
nodes, and exactly reproduce the lost contents. 

The bandwidth used by the newcomer to reproduce the lost 
contents can become a problem. Regenerating codes |1| are 
designed to minimize the code repair problem maintaining 



the advantages of using erasure coding in distributed storage 
environments. Note that the storage size problem and the code 
repair problem are both consequences of using erasure coding, 
but they refer to different states of the system. The former 
refers to the amount of stored data required to maintain an i- 
fault tolerance system. The latter refers to the bandwidth used 
to regenerate a failed node. 

This paper is organized as follows. In Section [TH re- 
generating codes are explained and defined. In Section [HI] 
previous contributions on erasure coding in distributed storage 
environments are shown. In Section IIVI quasi-cyclic MSR 
codes are presented and analyzed. In Section [V] the case when 
bandwidth is minimal is studied, and new codes with minimum 
bandwidth are constructed and analyzed. Finally, in Section 
IVII the conclusions of this work are exposed. 

II. Regenerating Codes 

In this section, first, the code repair problem and how 
it can be minimized using regenerating codes is explained. 
Then, the necessary conditions and bounds for codes to be 
called regenerating are shown. Finally, a general construction 
of regenerating codes is presented. 

Let M be the size of a file. In a replication scheme, if each 
node stores a data units, with a <C M, the newcomer must 
download a data units to replace the lost node. However, in 
an erasure coding scheme, the same newcomer must download 
M data units to store only a. 

In IT], the code repair problem is presented, and a solu- 
tion to minimize the bandwidth used to repair one node is 
proposed. This solution consists of using regenerating codes. 
The main idea of this kind of codes is the use of a network 
multicast technique for data transmission optimization, called 
network coding (6), in conjunction with erasure codes. The 
regenerating codes use erasure coding to minimize the amount 
of stored data required and use network coding to minimize 
the bandwidth used to regenerate a failed node. 

A. Conditions and Bounds 

Let C be a [n, k, d] regenerating code, where n is the total 
number of nodes in the system; k is the number of nodes 
which contain the minimum amount of information necessary 
to reconstruct the file; and d is the number of nodes, called 
helper nodes, necessary to regenerate one failed node. If each 
helper node sends (3 data units, the repair bandwidth used is 
7 = (3d. 

There are two conditions that any regenerating code must 
satisfy: 
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with ft ^ 1. Therefore, we can assume that j3 = 1. Then, 

Amsr = d-k + 1, (1) 
BMSii = k(d-k + l), (2) 

in the MSR case, and 
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in the MBR case. 



Figure 1. Optimal tradeoff curve between a and 7 for a [10,5,9] 
regenerating code. 



• Data reconstruction condition: Any subset of k nodes 
must contain enough information to reconstruct the file, 

so ak > M. 

• Node regeneration condition: The regeneration of any 
node in the system must require less bandwidth than the 
total file size M, so 7 < M. 

The former condition refers to the state when a Data Collector 
(DC), that is, the user who is allowed to access the data, 
wants to reconstruct the file. The latter condition refers to 
the state when a node fails, then, a newcomer enters into the 
system to replace it, and d helper nodes send information to 
the newcomer. Both conditions are formally explained in 0. 

The tradeoff between storage per node a and bandwidth to 
repair one node 7 is shown in [ 1 1. Clearly, the most interesting 
case is to minimize both a as well as 7. However, when one 
of them achieves its minimum value, the other one is not able 
to achieve it. The optimal tradeoff between a and 7 can be 
represented by a curve, which leads to two interesting points: 

> The Minimum Storage Regenerating (MSR) point: Each 
node stores the minimum possible amount of data 
dMSR = 4r> an d then minimizes the bandwidth, which 

Md 



gives an optimal value of jmsr - k (d-k+i) - 
• The Minimum Bandwidth Regenerating (MBR) point: 
Each node can be repaired downloading the minimum 
possible amount of data jmbr — 2kd-tS+k ' anc * ^ en 
minimizes the storage, which gives an optimal value of 

_ 2Md 
OtMBR - 2klTW+k- 

Figure Q] shows the optimal tradeoff curve for a [10, 5, 9] 
regenerating code. The MSR point corresponds to the lowest 
a, while the MBR point corresponds to the lowest 7. 

We can assume that the data units are bits. Moreover, we 
can consider sequences of bits of length r, which can be seen 
as symbols from a finite field F 9 , where q = 2 r . Therefore, a 
file with M bits can be seen as a file with B symbols over 
¥ q , where M = Br; and each node with a bits can be seen 
as a node with A symbols, where a = Ar. 

As shown in Q, to simplify the system implementation, a 
divide-and-conquer technique can be used in the regenerating 
codes applied to large files. This means that, if we can 
construct a [n, k,d] optimal regenerating code with (3 = 1, 
then we can construct a [n, k, d] optimal regenerating code 



B. Construction and Complexity 

Most regenerating codes are constructed in the following 
way. First, the B — Ak data symbols of the file are stored 
in k nodes. Then, A (n — k) redundancy symbols are created, 
using an erasure correcting code, and are stored in the others 
n — k nodes. 

When a node in the system fails, it has to be regenerated, 
that is, replaced by a new one. There are three different types 
of repair that can be used to regenerate a failed node in the 
system. 

• Functional repair: the newly generated nodes can contain 
different information from that of the failed node, as long 
as the repaired system still achieves the data reconstruc- 
tion condition. 

• Exact repair: the failed nodes are exactly regenerated, 
thus restoring exactly the lost encoded blocks with their 
exact replicas. 

• Exact repair of the systematic parts: the nodes with data 
symbols are using exact repair, while the nodes with 
redundancy symbols are using functional repair. 

The exact repair is more complex than the non-exact repair 
|8|. However, it is more useful, since in this case the DC can 
always recover the B data symbols of the file without undoing 
linear combinations in the system. In other words, there are 
always k nodes with data symbols. 

Figure [2] shows the construction and regeneration process 
using a functional repair [4, 2, 3] MSR code, with A = 2 and 
B = 4. More specifically, these two processes can be described 
as follows: 

1) The file is fragmented into B = Ak = 4 data symbols. 
The B — Ak = 4 data symbols are stored in k = 2 
nodes, and the A (n — k) = 4 redundancy symbols are 
computed using erasure coding and stored in the n — k = 
2 remaining nodes. 

2) The regeneration of a failed node is functional and can 
be achieved by using network coding. 

Note that this is an example of a 2-fault tolerance system in a 
distribute environment. Moreover, in the regeneration process, 
to restore one node, it is only necessary to download 3 symbols 
instead of B = 4. 

The use of network coding introduces some computational 
complexity J9|, since it requires not only solving a system 
of equations of complexity 0(n 2 ), but also computing co- 
efficients to construct linear independent equations. These 
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Figure 2. The process of construction and regeneration using a functional repair [4, 2, 3] MSR code, with A = 2 and B = 4. 



coefficients are difficult to find, especially if we want an 
exact repair. Moreover, they must be computed for each 
different failed node. The new families of regenerating codes, 
introduced in Sections [IV] and [V] will allow an exact repair 
minimizing this complexity. 



C. Relation with error-correcting codes 

It is important to note that regenerating codes are strongly 
related to classic error-correcting codes. Actually, a [n,k,d] 
regenerating code can be seen as an error-correcting code of 
length n and dimension k, where each node corresponds to 
a coordinate of the codewords. Moreover, since each node 
contains A symbols over F 5 , which can be seen as one 
symbol over an extension field ¥ q A , the error-correcting codes 
associated with regenerating codes are codes over ¥ q a . 

From the data reconstruction point of view, a DC needs to 
be able to connect to any subset of k nodes to reconstruct the 
whole file. In order to assure this condition, the regenerating 
code can be constructed from a Maximum Distance Separable 
(MDS) code over F q A. Moreover, in this case, the use of 
a MDS code will also assure that the system will add the 
minimum possible amount of redundancy. In general, it is not 
necessary to use a MDS code to construct a regenerating code. 
However, if we focus on the MSR case, when the amount of 
redundancy must be minimal, the error-correcting code used 
to construct a regenerating code must be a MDS code. 

From the node regenerating point of view, in order to 
minimize the amount of download symbols to regenerate any 
node in the system, it is important to consider each node as 
a sequence of A symbols over ¥ q rather than one symbol 
over ¥ q A . In this way, it is possible to design the regeneration 
process by downloading just some of the A symbols over ¥ q 
contained in each node. 



III. Related Work 

Other approaches to the distributed storage problem have 
been studied, for example, using hierarchical codes ifTOl or 
self-repairing codes ifTTl . fl2l . In the former case, the subset 
of required helper nodes depends on the node that needs to be 
repaired, and these subsets of helper nodes may have different 
sizes. In the latter case, instead of using classic error-correcting 
codes as a black box, new codes which allow regeneration of 
one symbol using few of the other symbols are constructed. 
However, these codes need to have a very low transmission 
rate, k/n, in order to approach the optimality achieved by the 
regenerating codes. 

Regenerating codes have been, so far, the most popular 
approach to the distributed storage problem, because of their 
flexibility, their applicability, and their capacity to use error- 
correcting codes with high transmission rate, k/n. They were 
introduced in 2007 [13], [14], when it was claimed that, using 
network coding together with erasure coding in a distributed 
storage system, it is possible to decrease the amount of 
bandwidth needed to regenerate one node. The technique used 
led to some functional repair constructions of high complexity 

ih, ma. 

Despite the functional repair is the most simple approach 
when network coding is used, the exact repair, which was 
introduced in [16) and lfl7l . is shown to be better in a practical 
environment. Specifically, in ifTBI . it is showed that an exact 
repair is achievable for d = n — 1 at the MSR point. In 
IfTTl . the first explicit construction for d = k + 1 at the MSR 
point, as well as an exact repair construction for d = n — 1 
using graphs at the MBR point are given. Meanwhile, in [8|, 
a computational search is done, and some explicit functional 
repair codes for [n = 5,fc = 3,d = 4] are found. In [18|, 
optimal exact repair MDS codes for [n > 2k, k,d = k + 1] 
are given. All these constructions require a large file size and 
have a high complexity. A survey on regenerating codes has 
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been published in [19], where all different types of repair are 
deeply explained, and the main open problems in this field are 
discussed. 

The "MISER" code, which assures an exact repair of the 
systematic parts for parameters [n > 2k, k,d — n — 1], is 
presented in fl20l and 1211 . In these papers, it is also shown 
that it is impossible to construct an exact MSR code when 
d < 2k — 3. In J22), it is claimed that the nodes with the 
redundancy parts of the "MISER" code can also be exact 
repaired. The "MISER" code can also be represented within 
a more general framework, called "Product Matrix". This 
framework was introduced in Q, and it is able to provide 
explicit constructions for the MBR and MSR points for a wide 
range of parameters. 

In ll23l and l24l . another approach is presented. Unlike 
the approach using regenerating codes, where any subset of 
d nodes is able to regenerate a lost node, in this case, a lost 
node needs to be regenerated by an specific subset of d nodes. 
Therefore, the rules and bounds are different. 

Finally, the use of array codes, that is, regenerating codes 
that are MDS if an array of elements is treated as one 
element, has also become important. For example, the well 
known "even-odd" [25] array code has been an inspiration for 
researchers in this field. 

In this paper, new families of regenerating codes, called 
quasi-cyclic codes, will be introduced and studied in detail 
for the MSR and MBR points. In fact, they can be seen as 
exact repair array codes which use a simple cyclic construction 
to minimize both, the complexity of using network coding in 
the nodes, and the complexity of the algorithms needed to 
regenerate a failed node. Because of the use of an specific 
subset of d helper nodes for each failed node, it is possible to 
achieve, and in some cases to beat, the optimal values a and 
7 according to the tradeoff curve shown in [1|. Some results 
in this paper were presented in part at two conferences [26|, 
11221- 
There have been other approaches to the distributed storage 
problem using quasi-cyclic codes [28 1, which also points out 
that quasi-cyclic codes can be significant in the distributed 
storage field. 

IV. Quasi-cyclic MSR codes 

In this section, we will describe the quasi-cyclic MSR 
codes in detail. Specifically, in Subsection IIV-AI we will 
show how to construct them and some of their properties; in 
Subsection IIV-BI we will see how to regenerate a failed node; 
in Subsection IIV-CI we will prove their existence by showing 
that the data reconstruction condition is achieved; and finally, 
in Subsection IIV-DI we will describe an example of a [6, 4, 3] 
quasi-cyclic MSR code. 

A. Code Construction 

The [n, k,d] quasi-cyclic MSR codes will be composed of 
a set of storage nodes, denoted by {v\, V2, ■ ■ ■ ,v n }, and will 
satisfy the following two conditions: 

• d = k + 1: the number of helper nodes is k + 1, where 
k > 1. 



• B = n: the number of nodes is the same as the number 
of data symbols. 

Therefore, by equations (fl~|i and (|2j, they will also satisfy that 
A = 2 and B = n = 2k. 

In order to construct a [n, k, d] quasi-cyclic MSR code, 
given by the circulant coefficients (ci, . . . , c^), we can follow 
the next steps: 

1) Fragment the file into B = n data symbols a = 
(a\, ci2, ■ ■ ■ , a n ), where 6 ¥ q for i = 1, . . . , n. 

2) Construct n redundancy symbols r = (ri, T2, . . . , r n ) 
by 



k+i 

c j-i a j 



1, 



(5) 



where q e ¥ q \ {0} for I = 1, . . . , k and j = i + 
1, . . . , k + i mod n. 
3) Store (ai, ri) in the node Vi for i = 1, . . . , n. 

Let S be the set of all subsets of {1, . . . , n} of size k. Let 
D be an n x n matrix over F 9 and let s — . . . , i^} 6 S. 
Let Z)W denote the i-th vector column of D and D s denote 
the n x k submatrix of D given by the k columns determined 
by the set s. 

Let F — (I\M) be a n x 2n matrix, where / is the n x n 
identity matrix, and M is a n x n circulant matrix M defined 
as follows: 



M = 



( o 

ci 



Cfe 





V o 



Cfc-l 
Cfe 



Cl 
C2 



Cfe 









Cfe-l 
Cfe 










Cfe Ck-l Ck-2 



C2 




o J 



(6) 

where ci 6F ? \ {0} for I = 1, . . . , k. The matrix F represents 
the system. Each row corresponds to one data symbol, and 
each node is represented by two columns, one from / and one 
from Al. Actually, the node Vi, which stores (a^r^), is also 
given by 

(a i ,r i ) = (aJ«,aM«). 

Note that the data symbols are represented by the identity 
matrix /, while the redundancy symbols are represented by 
the circulant matrix M. 

Circulant matrices have been deeply studied because of 
their symmetric properties ||29l . Moreover, F can be seen as 
a generator matrix of a double circulant code over ¥ q 1 30 1 . 
Double circulant codes are a special case of quasi-cyclic codes, 
which are a family of quadratic residue codes. 

Figure [3] shows the construction of a quasi-cyclic MSR code. 
First, the file is split into n symbols over ¥ q . Then, these 
symbols are encoded using F and producing 2n symbols. 
Finally, each two symbols are stored together in one node. 
As stated in Subsection III-CI two symbols over F 9 can be 
seen as just one symbol over the finite field ¥ q 2 . 
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Figure 3. Construction process for a [n, k, d] quasi-cyclic MSR code. 

B. Node Regeneration 

In this subsection, we will show how to regenerate a failed 
node minimizing the required bandwidth. Actually, we can just 
follow the next algorithm: 

1) Download the data symbols from the next k nodes in 
the sequence. Note that due to the circulant scheme, the 
next node of v n is v\. From these data symbols, the 
redundancy symbol of the newcomer can be calculated. 

2) Download the redundancy symbol from the previous 
node in the sequence, following the same circulant 
scheme. Solving an easy system of equations, the data 
symbol of the newcomer can be obtained. 

When the repair problem is faced, it is clear that quasi-cyclic 
MSR codes achieve bandwidth optimality for parameters d = 
k + 1 and B = n, by equations (Q]l and (01. Moreover, note 
that the failed node have to be regenerated by a specific subset 
of d = k + 1 nodes. 

C. Data Reconstruction 

As we have seen in the previous Subsection IIV-BI node 
regeneration achieves optimality according to the bounds 
given in Q). In this subsection, we will prove that the data 
reconstruction condition is also achieved and, as a conse- 
quence, quasi-cyclic MSR codes exist. In [26 1, we performed 
a computational search to claim the existence of quasi-cyclic 
MSR codes. Now, we will prove the existence of such codes 
theoretically. 

Let F s = (I S \M S ) denote the n x n submatrix of F 
determined by s — {ii, ...,«&} £ S. Let p s {c\, c 2 , . . . , c k ) £ 
Fq[ci, . . . , Ck] be the multivariate polynomial associated with 
the determinant of F s = (I S \M S ). 

Assume that a DC wants to obtain the file. Then, 
it connects to any k nodes {u^ , . . . , Vi k } and down- 
loads (dix) • • • , (GSi fc > fi k ). In order to obtain a = 
(oi, a 2 , . • • , a n ), we need F s to be full rank. Moreover, in 
order to satisfy the data reconstruction condition, we need F s 



to be full rank for all s G S. Therefore, we have to prove the 
following two statements: 

1) The polynomial associated with the determinant of F s 
is not identically zero. 

2) The polynomial associated with the determinant of F s 
is nonzero with high probability, for a random choose 
of the circulant coefficients (ci, . . . , Cfc). 

We will begin by proving the first statement. It is known 
that there exists a relation between determinants of matrices 
and bipartite graphs. Let G(VrU Vc, E) be the bipartite graph 
associated with a matrix F s , where each row of the matrix is 
represented by a vertex vri in Vr, and each column of F 3 
is represented by a vertex vci in Vc, where i = l,...,n. 
Two vertices vri £ Vr, vci £ Vc are adjacent if the entry 
in the row i and column j of F s is nonzero. Moreover, the 
weight of this edge is the nonzero value of the i, jth entry. 
Let T(vci) (resp. r(ur,)) denote the neighbors of wc, (resp. 
vri) in the graph G, and let Y(T) denote the set {j™ =1 T(U), 
where T = {t\, t 2 , . . . , t m } C Vc (resp. Vr). 

Lemma 1 ( 0311 ). The polynomial associated with the determi- 
nant of F s is not identically zero if and only if its associated 
bipartite graph G(Vr U Vc, E) has a perfect matching. 

To prove the existence of a perfect matching in the bipartite 
graph G(Vr U Vc, E) associated with F s , we will use Hall's 
theorem. 

Lemma 2 ( 11321 ). A bipartite graph G with vertex sets Vr and 
Vc contains a complete matching from Vr to Vc (resp. Vc to 
Vr) if and only if it satisfies Hall's condition for any T C Vc 
(resp. T C Vr), 

\T\ < \T(T)\. 

Moreover, if \Vr\ = \Vc\, the complete matching corresponds 
to a perfect matching. 

Lemma 3. Let T C Vc such that T ^ 0, then \T(T)\ > \T\. 

Proof: Note that Vc has k vertices of degree 1 and k 
vertices of degree k. We can decompose T = T\ U T%, where 
T\ contains vertices of degree 1 and contains vertices of 
degree k. It is clear that |r(Ti)| = |Ti| by construction, and it 
is easy to see that |r(T 2 )| > fc+|T 2 |-l > \T 2 \ by the circular 
construction of matrix M and because k > 1. Therefore, we 
can assume that Ti ^ and T 2 ^ 0. 

If |Ti| < k- 1, since |r(T x ) n T(T 2 )\ < |r(T x )| = |T X |, 
we have that \T(T)\ = |T(Ti)| + |r(T 2 )| - |T(Ti) n T(T 2 )\ > 
\Ti\ + k+ \T 2 \ - 1 - |Ti| > |Ti| + |T 2 | = \T\. On the other 
hand, if |T X | = k, then |T(Ti) n r(T 2 )| < — 1 since for 
each different vertex Uj £ T 2 , there exists a different vertex 
Vj i £ T\ such that T(i>i) D r(«j- 4 ) = 0. Thus, we also have 

that |r(T)| = |r(Ti)| + |r(T 2 )| - |r(T x ) n r(T 2 )| > 1^1 + 

k+ \T 2 \ - 1 - |Ti| + 1 > |T X | + |T 2 | = \T\. M 

Proposition 4. The polynomial associated with the determi- 
nant of F s is not identically zero. 

Proof: Since \Vr\ = \Vc\ and using Lemma [2] and 
Lemma[3j the bipartite graph G(Vr U Vc, E) associated with 
F s has a perfect matching. Finally, by Lemma Q] we know 
that F s is not identically zero. ■ 
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Figure 4. Distribution of symbols using a [6, 3, 4] quasi-cyclic MSR code over F5 with coefficients (ci, C2, C3) = (1, 1, 2). 



For the second statement, we have to prove that for a 
random choose of the circulant coefficients (ci, . . . , c„), the 
multiplication of all the multivariate polynomials associated 
with the determinant of all matrices F s ,s G S, is nonzero 
with high probability. 

Let q(ci,...,Ck) be the multivariate polynomial 
g(ci,...,Cfc) = IL G sPs( c i>---> c fc)- Note that if 
q(ci, . . . , c fe ) ^ 0, then p s (ci, . . . , c k ) ^ for all s e S. 

Lemma 5. The degree of q{c\, . . . ,Ck) is less than or equal 
to k(2). Formally, deg(q(ci, . . . , c fe )) < fc(£). 

Proof: Each Ci, i = 1, . . . , k, can appear a maximum of k 
times in F s . By Lagrange minor's theorem, p s (ci, . . . , Ck) has 
a maximum degree of k. By the definition of q(c\, . . . , C&), 
deg(g(ci,...,c fc )) < fc(fe). ■ 

Theorem 6. 77ze (^) submatrices F s , s £ S, are /mz7 ranfc 
vvz'f/z /zz'g/z probability for a sufficiently large finite field ¥ q . 

Proof: Knowing Corollary [4] and using the Schwartz- 
Zippel lemma [33 1, we know that 

deg(q(ci, . . . ,c k )) 



'1 

dcg(g(ci, 



Oh)) 



And 



Pr(o( Cl ,...,c fe ) = 0) < 

Therefore, Pr(q(ci, . . . ,c k ) ^ 0) > 1 
by Lemma|5] we know that deg(q(ci, . . . , c&)) < &(?), so for 
a sufficiently large field size q, submatrices F s , s 6 S, are 
full rank with high probability. ■ 

Summarizing, there is a set of full rank matrices F s , s £ 
S, for a random choose of the circulant coefficients and a 
sufficiently large field. This means that there exists such F 
that represents a quasi-cyclic MSR code. 

D. Example 

In this subsection, we will describe the construction of a 
[6,3,4] quasi-cyclic MSR code over F 5 . 

First, the file is fragmented into B = n = 6 data symbols 
a = (ai, <22, . . . , do). Then, each a» for i — 1,...,6 is 
stored in a node Vi = (aj,fj), along with its corresponding 
redundancy symbol r i7 which is computed using a circulant 
matrix F of the following form: 



F = 



( 1 


























C3 


C2 


Cl 


\ 





1 














Cl 











C3 


C2 










1 











C2 


Cl 











C3 













1 








C3 


c 2 


Cl 

























1 








C3 


c 2 


Cl 










V o 














1 








C3 


c 2 


Cl 





/ 



(7) 



By construction, the node regenerating condition is always 
achieved. However, in order to satisfy the data reconstruction 
condition, we need to find nonzero coefficients (ci,C2,c 3 ) 
such that q(ci,C2,C3) ^ over F5. Since q(ci, 02,03) = 
cl 4 4 2 4(~4 + cic 3 ) 5 (4 + c?)(-cicj + c 2 2 c 3 )(-4 - cl), a 
possible solution over F5 is (01,02,03) = (1,1,2). Figure |4] 
shows the distribution of data and redundancy symbols in the 
nodes using these circulant coefficients. 

Using the same argument, it is also possible to construct 
a [6,3,4] quasi-cyclic MSR code over Fg with circulant 
coefficients (01,02,03) = (1,1, 2), where z is a primitive 
element. Note that there is not any [6,3,4] quasi-cyclic MSR 
code over F 2 , F 3 , F4 and F7. 

V. Regenerating codes with optimal bandwidth 

In this section, we will present a new construction to create 
regenerating codes from other regenerating codes, called base 
regenerating codes. The idea is that the new codes minimize 
a subject to having the minimum 7. We will see that if a 
MSR code is used as the base code, for a specific choice 
of the parameters involved, the constructed codes are MBR 
codes, so they satisfy the optimal equations for the MBR point. 
Moreover, for other parameters, it is possible to decrease a 
and 7, which results in an improvement of the fundamental 
tradeoff curve for regenerating codes. 

The construction is based on graphs and generalizes a 
construction used in [23 1 and [24 1, where MDS codes are used 
as base codes to construct MBR codes. 

A. Code Construction 

Let C be a [n,k,d] regenerating code with storage nodes 
V = {vi, V2, ■ ■ ■ , v„}, where any subset of size k is enough 
to reconstruct the file. Let C be a [n, k, d) regenerating code 
with storage nodes W = {^1,1^2, . . . ,w ft }, where any subset 
of size k is enough to reconstruct the file. Now, we will show 
how to construct the new regenerating code C from the base 
code C. 

Lemma 7. Given n > 3, there exists a simple, undirected 
and d-regular graph H(W, V), where W is the set of vertices 
and V is the set of edges, satisfying the following conditions: 

\W\ = n, \V\ = n, 1 < d < n, and dn = 2n. 

Proof: Condition dn = 2n is given by the Handshaking 
lemma for a simple, undirected, and d-regular graph. By 
Erdos-Gallai degree sequence theorem, for d > 1 and d < n, 
there exists at least a simple, undirected, and d-regular graph 
such that dn = 2n. ■ 
Note that for d = ft — 1, we have the complete graph H = 
K Tl already presented in [24], and for d — 2 we have the cycle 
graph H = C n . 
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The set V = {vi,V2, ■ ■ ■ , v n } of storage nodes of the base 
regenerating code C can be seen as the set of edges of a 
simple, undirected, and d-regular graph H (W, V) of order n 
and size n which satisfies that 1 < d < n and dn = 2n by 
Lemma [7] Note that the set of vertices W can be determined 
by the set of edges V and the degree d. The set W represents 
the storage nodes in the new distributed storage system and 
every node u G W contains the d symbols determined by the 
d edges connected to w. Note that two vertices u>i,cjj G W 
are adjacent if, and only if, the corresponding storage nodes 
U!i,ujj share the same symbol in V. 

The graph H(W, V) can be transformed into a bipartite 
graph G(V U W, £), where V and W are the two disjoint 
sets of vertices, and £ is the set of edges. An edge between 
oji and Vj means that the storage node u>i stores the symbol 
Vj . Therefore, the vertices in V have degree 2 and the ones in 
W have degree d. Both graphs can be used without distinction 
to represent the construction of these new regenerating codes. 
However, some properties will be easier to prove using G than 
using H, so we will use both from now on. 

Assume that (3 = 1, so without loss of generality the n 
symbols in V will be considered as n bits. Let T(u>i) be the 
set of neighbors of the node oji G W. Note that in G(VUW, £), 
\T(u>i)\ = d means that the storage node stores d symbols, 
whereas in H(W, V), |r(o>j)| = d means that w,; has d helper 
nodes. 

Let S be the set of all subsets of {1, . . . , n} of size k > 1 
and let s G S. In G(V UW,£), the number of neighbors of a 
subset of k vertices in W is | \J ies T(wi)\ = J2ies \^( w i)\ ~ 
p, where p represents the intersection terms in the inclusion- 
exclusion formula. Since each vertex v G V has degree 2, 
for s' C s of size \s'\ > 2, | flies' r ( w 0l = and 9 = 
E*<j, ij 6S |rK) nr^)!. Note that ir^) n T( Wj )\ < 1 
for i, j G {1, . . . , n} and i =/= j, and p < (^)- 

Lemma 8. A [n, fc, d] regenerating code C can be constructed 
from a [n,k, d] regenerating code C if dn = In, 1 < d < k < 
kd — p, and 1 < k < n. 

Proof: Given a file distributed using C, we know that 
there are n nodes and that any k of those n nodes are enough to 
reconstruct the file. If the n symbols {v\, t>2, . . . , v n } represent 
the edges of H(W, V), by Lemma|7]we know that there exists 
a set of n vertices W such that it is possible to construct 
H(W, V) with 1 < d < n, and dfi = In. 

Given G(V U W,£), in order to achieve the regenerating 
condition, it can be seen that 7 = d < M < k is enough. The 
condition 1 < k < n is trivial. Finally, in order to achieve the 
data reconstruction condition, any subset of k nodes must store 
at least k symbols from C, so k < [j ieg r(wj)| = kd — p, 
which is enough with 1 < |s| = k < n. 

If those conditions are achieved, then H(W, V) can be 
constructed and C is a regenerating code. ■ 

As the goal is to minimize a subject to having the minimum 
7, using the above construction, we are interested in base 
regenerating codes which already have a minimum a. It is well 
known that the codes that have a minimum a are the MSR 
codes. Therefore, we will assume that the base regenerating 
codes are MSR codes. 



RC with 

MSR code optimal 

bandwidth 




Figure 5. A bipartite graph G(V U W,£) with d = 3, which represents 
the construction of a regenerating code with optimal bandwidth from a MSR 
code. 

In the following subsections, we will show for which pa- 
rameters n, k, d, n, k and d there exists a [n, k, d] regenerating 
code constructed from a [n,k,d] MSR code. We will also 
prove that equations Jmbr = ol M br = 2 kd-J+k are 
achieved and in some cases improved. These new regenerating 
codes have an optimal bandwidth because given a, it is 
impossible to regenerate one node using less bandwidth. 

To illustrate the construction, in Figure |5j a bipartite graph 
G(V U W, £) with d = 3 is shown. Each storage node 
of the MSR code represents one vertex Vj G V, and each 
storage node of the regenerating code with optimal bandwidth 
represents one vertex cjj G W. In other words, regenerating 
codes with optimal bandwidth are constructed from MSR 
codes using bipartite graphs. 

B. Node Regeneration 

Assume that a storage node fails, which is the same as 
erasing one vertex Ui G W in H(W,V). The newcomer oj 
can replace u)i by downloading and storing one symbol from 
each one of its d neighbors. 

Note that regenerating codes with optimal bandwidth use ex- 
act repair and that the node regeneration is done by a property, 
called repair-by-transfer, which means that each newcomer 
will download not only the same amount of information that it 
has to store, but also exactly the same symbols. Regenerating 
codes with the repair-by-transfer property are very interesting 
because no computational operations are neither needed in the 
helper nodes nor in the newcomer. 

It is easy to see that a node with a bits can not be 
regenerated by less than a bits, so 7 > a. Therefore, the 
minimum possible bandwidth 7 needed to regenerate a node 
is the amount of stored data a in the failed node. Regenerating 
codes with optimal bandwidth achieve the minimum, so they 
satisfy that a = 7. Moreover, as we have assumed that j3 = 1, 
we can say that the minimum amount of bandwidth is achieved 
when the number of symbols stored per node, A, is equal to 
the number of downloaded symbols d, so A = d. 



x 



In the general framework of regenerating codes, when 7 



2Md 



" — ~zkdZTkT+k' we are m me MBR point of the fundamental 
tradeoff curve. In this case, without loss of generality, we can 
say that 7 = a = Ar = dr = 2 kd-k^+k • ^ e varue of a = 7 
for regenerating codes with optimal bandwidth coincides in 
some cases with the general framework bound for MBR codes, 
and in some other cases it is lower. In the next subsection, we 
are going to find the minimum a subject to 7 = a. 



C. Data Reconstruction 

In this subsection, we will find the minimum a for [n, fc, d] 
regenerating codes with optimal bandwidth. In G(V U W,£), 
it can be seen that d = \T(u>i)\ for all i = 1, ...,n. To 
reconstruct the file, any subset of fc vertices in W must be 
adjacent to a subset of at least k vertices in V. 

Lemma 9. Fixed k > 1, the minimum a is achieved when 
k = kd — p. 

Proof: By Lemma [8] we know that k < kd — p. Since 
f3 = 1 and 7 = d = a, the minimum d will lead to the 
minimum a 

k = kd — p. 

As we are trying to minimize a, we assume the equality 
k = kd — p given in Lemma [9] and we establish an upper 
bound for the parameter p in Proposition [10] 



[6, 3, 4] quasi-cyclic MSR code 



As d > the minimum d is achieved when 



Proposition 10. In the bipartite graph G(VL)W,£) with k = 
kd — p, we have that 

1) p<(]) ifk<d+l, 

2) P < L^J(T) + ( k mod 2 (d+1) ) ifk>d + l. 

Proof: Each node Ui £ W has d edges, so fc nodes have 
kd edges. Now, we will distinguish two cases. 

Case k < d + 1: In H (W, V), each vertex tOj shares one, 
and only one, edge with another vertex 00 j. Each vertex taj, 
i 6 s and |s| = k, can share a maximum of one edge with each 
one of the other vertices 6 s, i ^ j. Then, the maximum 
number of shared edges is (l\ . In other words, when fc < d+l, 
it is possible to create a complete subgraph of fc vertices in 
H(W, V) with (2) edges. 

Case k > d + 1: Given H (W, V) and s, we are going to 
construct a subgraph which maximizes the number of shared 
edges. Each vertex Ui, i S s, can share a maximum of d 
edges with the remaining vertices u>j,j G s, i ^ j. Therefore, 
the maximum number of shared edges is when we consider a 
complete subgraph with d+l vertices and edges. As k > 
d+1, there could be L^[J complete subgraphs, each one with 
( d ~^ 1 ) edges. The vertices out of these complete subgraphs can 
share a maximum of ( fc mod 2 edges, which leads to the 

upper bound p < [J^\ + ( fc mod 2 ^ d+1 '>) for k > d+ 1. 

■ 

Now, we are going to analyze the amount of data stored per 
node a for each one of the cases given in Proposition [10] 

1) Case k < d+l: Depending on the set of parameters 
{n, k, d, n, k, d], it is possible to construct a [n, k, d] regener- 
ating code with optimal bandwidth from a [n, k, d] MSR code 
for k < d + l following Lemma [8] Moreover, the maximum 
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[6, 2, 2] regenerating code with optimal bandwidth 



Figure 6. Construction of a [6, 2, 2] regenerating code with optimal 
bandwidth from a [6, 3, 4] quasi-cyclic MSR code. 



a = 7 achieved using this construction gives exactly the same 
optimal parameter (Xmbb. = Imbr as shown in subsection 

Theorem 11. For k < d + 1, f/ze amount of data stored per 
node in a regenerating code with optimal bandwidth, a — d=c, 

is upperbounded by cumbr = ^lJ^tttt: > tne one claimed as 
optimal in subsection Ml- PA or , 



2kd-k 2 +k' 

SB- 



Proof: By construction, it can be seen that each node 
u>i € W in a regenerating code with optimal bandwidth stores 
d symbols of size 4p because the base code is a MSR code. 
The total storage per node is a = d¥-. 

By Proposition [10] we also know that k > kd — • The 
maximum a is given when k is minimum, which means that 
k = kd— (2) • Replacing k in equation a = d^- with k = kd— 

(2), we obtain that a is upperbounded by (Xmbr = 2kd-k?+k ' 

■ 

Figure [6] shows an example of a regenerating code with 
optimal bandwidth for k < d+l. In this particular example, a 
quasi-cyclic MSR code is used as a base code to construct the 
regenerating code with optimal bandwidth. Each node € W 
can be repaired downloading half node uii-i and half node 
tOj+i. Moreover, any k — 2 nodes in C contain at least k = 3 
different nodes in C, which allow us to reconstruct the file. 
Note that a = |M = aMBR- 

2) Case k > d+l: Depending on the set of parameters 
{n, fc, d, n,k,d\, it is possible to construct a [n, fc, d] regener- 
ating code with optimal bandwidth from a [n, fc, d] MSR code 
for fc > d + 1 following Lemma [8] Moreover, the maximum 
a = 7 achieved using this construction is less than the optimal 
parameter aMBR = Imbr of the MBR codes shown in 
subsection IH-AI 

Note that according to Q], where flow graphs are used to 
prove the optimal tradeoff curve between a and 7, requiring 
any d storage nodes to have a flow of M will lead to the same 
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[10, 5, 6] quasi-cyclic MSR code 













-2 + 5"3 + "3 + 5-4 + "4 + 5<*5 + "5 + 5»6 + »6 + 5 °7 + "7 + 5a 8 + 
2a 4 4-a 5 4-a 6 2a s +a 6 + a 7 2ag + a 7 + a g 2a 7 + ag + ag 2a g +ag + a 10 2ag+a 10 + a 1 


ag + Sag + 
2a 1Q + ai + a 2 


ag + 5a 10 + 
2ai +a 2 +a 3 


"10 + 5-1 + 
2a 2 +a 3 +a 4 


■1 + 5a 2 + 
2a 3 +a 4 + a 6 



Q l a 2 a 3 a 4 a 5 a 6 a 7 a 8 a 9 a 10 



a 2 + 5a 3 + 
2a 4 + a 5 + ag 


a 3 + 5a 4 + 
2a 5 +a 6 + a 7 


a 4 
2a 6 


+ 5a 5 + 
+ a 7 + a 8 


a 5 
2„ 7 


+ 5a 6 + 
+ a 8 + ag 


a 6 + 5a 7 + 
2a 8 +ag + a 10 


a 7 + 5a 8 + 
2ag + a 1 n + a 1 


a 8 + Sag + 
2a 10 +a 1 + a 2 


ag + 5a 10 
2 01 + a 2 + 


+ 


a 10 + Sax 
2a 2 +a 3 + 


+ 
a 4 


-1 + 5a 2 + 
2a 3 +a 4 +a 5 




a 2 


a 3 




a 4 




a 5 


a 6 


a 7 


" 8 


ag 




a 10 




a l 


a 3 + 5a 4 + 
2a 6 +a 6 + „ 7 


a 4 + 5a 5 + 
2a 6 + a 7 + ag 


B| 

2a 7 


+ 5a 6 + 
+ a 8 + ag 


a 6 
2„ s 


+ 5a 7 + 
+ ag + a 10 


a 7 + 5a 8 + 
2ag+a 10 + a 1 


a 8 + 5ag + 
2a 1Q +a 1 + a 2 


ag + 5a 10 + 
2«1 +a 2 + a 3 


a 10 + 5oj 
2a 2 + a 3 + 


+ 
a 4 


aj + 5a 2 
2a 3 + a 4 + 


+ 
»5 


a 2 + 5a 3 + 
2a 4 + a 5 + ag 



[10, 4, 2] regenerating code with optimal bandwidth 



Figure 7. Construction of a [10, 4, 2] regenerating code with optimal bandwidth from a [10, 5, 6] quasi-cyclic MSR code. 



condition as requiring any k storage nodes to have a flow of Theorem 14. For k > d + 1, f/ie amount of data stored per 



2Md 



2kd~k A +k 



becomes 



M, so when d < k equation olmbr = 

_ 2Mk 
OiMBR - 2dk-d'i+d- 

Proposition 12. For k > d+1, the amount of data stored per 
node in a regenerating code with optimal bandwidth is 

M 



-M - 
a = d — < d 
k 



w-(Lakli 



Ik mod 
I 2 



)) 



Proof: The result follows taking into account that each 
symbol of a MSR code has size and using that kd> k> 

w-(L 3 |rJft 1 ) + ( s mod 2 (d " + % ' ■ 

When fe < d + 1, we have seen that the minimum value of 
a also corresponds to the MBR point. However, as we will 
show in Theorem [Pfl if k > d+ 1, it is possible to decrease a 
while maintaining a = 7, which improves the tradeoff curve 
for regenerating codes. 

Lemma 13. For k > d + 1 > 2, 

> 



2dfc - d 2 + d fed - /(fc) ' 

wh eref(k)=l^]( d t 1 ) + ( k ™\ (d+1) ). 

Proof: Since fe > d + 1 > 0, it is enough to prove that 

kd kd 



d 2 -d 3 



> 



k 2 d-kf{k)' 



< k 2 d~kf(k). Fixed d, we define 



d?k H 

or equivalently d 2 k+ 
X(k) = d 2 k + and Z(k) = k 2 d - kf(k). 

First, we will prove that for k > d+ 1 > 2, f(k) < Let 
= g(d + 1) + r where < r < d. If r = 0, then it is easy to 
see that f(k) = . If r > 0, then f(k) = + < 

g(d + l)| + r| = fcf. 

Let F(fc) = fc 2 d - fc 2 | = A; 2 g. Note that F(fc) < Z(k), 
and it is easy to see that X(k) < Y(k) for k > 2(d + 1). So 
in this case, X(k) < Z(k). 

For d + 1 < k < 2(d + 1), let k = d + 1 + t with < 
t < d + 1. We have that Z(fc) = fc 2 ri - fc^ 1 ) + (2)) = 

2k*d-k(d+l)d-kt(t-l) ^ h ^ be ^ ^ x(fc) < 



noc/e /n a regenerating code with optimal bandwidth, a = d j, 

j's /esi f/zan olmbr — 2dk M <i 2 +d ' ^ e one c ^ a ' mee ^ as optimal 
in subsection \II-A\ or HJjl. 



Proof: By Proposition [121 and Lemma 1131 we know that 

2k d 
2dk-d 2 +d > M-l^i^)-^™^) S ° a< aMBR - " 

Figure |7] shows an example of a regenerating code with 
optimal bandwidth for k > d+ 1. Note that the MSR code 
used as the base code is in fact a quasi-cyclic MSR code. 
Each node Wj s W can be repaired downloading half node 
u>i-i and half node Wj+i. Moreover, any fc = 4 nodes in (7 
contain at least k = 5 different nodes in C, which allows us 
to reconstruct the file. Note that a — |M < |M = Ombr- 

D. Regenerating quasi-cyclic codes with optimal bandwidth 

It is possible to use quasi-cyclic MSR codes as base 
regenerating codes to create regenerating codes with optimal 
bandwidth, using the above construction. In fact, the regener- 
ating codes with optimal bandwidth shown in Figures [6] and 
[7] are quasi-cyclic regenerating codes with optimal bandwidth. 
In this subsection, we will analyze the resulting parameters of 
these quasi-cyclic regenerating codes with optimal bandwidth. 

Proposition 15. For k < d+1, there exists a [n, k, d] quasi- 
cyclic regenerating code with optimal bandwidth constructed 
from a [2k, k,k + 1] quasi-cyclic MSR code if the set of 
parameters fc,d} achieve: 



k = 



k(2d-k + l) 



2k(2d-k+l) 

n= d ' 

1 < d < k. 

Proof: The solution of the system composed by k in the 
worst scenario k = kd — u) , n = 4p and 1 < d < k leads to 
these three equations. ■ 

Proposition 16. For k > d + 1, there exists a [n, k, d] quasi- 
cyclic regenerating code with optimal bandwidth constructed 
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Figure 8. Parameters [n, fc, d] for some quasi-cyclic regenerating codes with 
optimal bandwidth constructed from a [n, k, d] quasi-cyclic MSR code and 
with optimal parameter a = 7, which is compared with cxmbr = 1MBR- 



from a [2k, k,k + 1] quasi-cyclic MSR code if the set of 
parameters {fc, n, k, d\ achieves: 



k = kd ■ 



k 



d+1 



d+1 
2 



k mod (d + 1 
2 



n 



2n 

T' 
1 < J < fc. 

Proof: Straightforward from properties of quasi-cyclic 
MSR codes and Proposition [TOl ■ 

Figure [S] shows the parameters of some quasi-cyclic re- 
generating codes with optimal bandwidth. The first column 
shows the parameters [n, k, d] of the quasi-cyclic regenerating 
codes with optimal bandwidth. The second column shows the 
parameters [n, k, d] of the corresponding quasi-cyclic MSR 
codes. The third and forth columns compare the parameter 
a = 7 for the optimal one as stated in [ 1 J and the one achieved 
by the quasi-cyclic regenerating code with optimal bandwidth. 
First part of the table shows cases when k < d+1, and the 
second part cases when k > d + 1 

The existence of quasi-cyclic regenerating codes with opti- 
mal bandwidth proves that the quasi-cyclic regenerating codes 
are a complete family of regenerating codes, which can be 
constructed for both MSR and MBR points of the tradeoff 
curve. However, note that the construction presented in the 
previous section is independent to the base code chosen as 
long as the base regenerating code is a MSR code. 

VI. Conclusions 

In this paper, we provide some interesting contributions 
to the current state of the art of erasure codes applied to 
distributed storage. We construct a complete family of regen- 
erating codes using quasi-cyclic codes. 

At the MSR point, we provide an exact repair solution for all 
parameters achieving d = k + 1 and n = 2k. This construction 
is optimal according to the MSR point in the fundamental 
tradeoff curve. Moreover, quasi-cyclic MSR codes have a very 
simple regenerating algorithm that approaches to the repair- 
by-transfer property. In our solution, the helper nodes do not 
need to do any linear combination among their symbols. The 
only linear combination is done in the newcomer to obtain the 
symbols the first time that enters into the system. As far as 



we are concerned, this is the first construction achieving this 
repair simplicity for the MSR point. We also claim that such 
codes exist with high probability. 

If we want to achieve bandwidth optimality, we give a 
construction which using graphs and an existing MSR regener- 
ating code produces a code with minimum bandwidth. When 
k < d + 1, we provide an exact repair-by-transfer solution. 
This construction has the minimum possible bandwidth 7 = 0? 
and we claim that it is the same value as the one shown to 
be optimal according to the MBR point in the fundamental 
tradeoff curve. Moreover, when k > d+1, we also provide an 
exact repair-by-transfer solution and we show that it has the 
minimum possible bandwidth 7 = a. Moreover, we claim that 
this value is less than the one shown to be optimal according 
to the MBR point, which means that our construction is more 
efficient in terms of a and 7. This is possible because we relax 
the condition about the specific subset of helper nodes used. 

Finally, we show that quasi-cyclic MSR codes can be used 
as base codes to construct the regenerating quasi-cyclic codes 
with optimal bandwidth. We provide the conditions needed on 
the parameters {n, k, d, n, k, d} for both cases, when k < d+1 
and when k > d+1. 

Summarizing, this paper provides a construction of efficient, 
simple and optimal MSR codes at the rate of r = 1/2, and 
a construction of efficient and simple codes with minimum 
bandwidth that improves the optimal bound for regenerating 
codes by relaxing the condition about the subset of helper 
nodes. 
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